首页> 外文OA文献 >Effects of Transcription Errors on Supervised Learning in Speech Recognition

【2h】

Effects of Transcription Errors on Supervised Learning in Speech Recognition

机译：转录错误对语音识别中监督学习的影响

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Supervised learning using Hidden Markov Models has been used to train acousticmodels for automatic speech recognition for several years. Typically clean transcriptionsform the basis for this training regimen. However, results have shown that using sources ofreadily available transcriptions, which can be erroneous at times (e.g., closed captions) donot degrade the performance significantly. This work analyzes the effects of mislabeleddata on recognition accuracy. For this purpose, the training is performed using manuallycorrupted training data and the results are observed on three different databases: TIDigits,Alphadigits and SwitchBoard. For Alphadigits, with 16% of data mislabeled, theperformance of the system degrades by 12% relative to the baseline results. For a complextask like SWITCHBOARD, at 16% mislabeled training data, the performance of thesystem degrades by 8.5% relative to the baseline results. The training process is morerobust to mislabeled data because the Gaussian mixtures that are used to model theunderlying distribution tend to cluster around the majority of the correct data. The outliers(incorrect data) do not contribute significantly to the reestimation process.

机译：使用隐马尔可夫模型的监督学习已被用于训练用于自动语音识别的声学模型已有几年了。通常，干净的抄写构成此训练方案的基础。但是，结果表明，使用随时可用的转录源（有时可能会出错）（例如，隐藏字幕）不会显着降低性能。这项工作分析了标签错误的数据对识别准确性的影响。为此，使用手动损坏的训练数据执行训练，并在三个不同的数据库上观察结果：TIDigits，Alphadigits和SwitchBoard。对于字母数字，错误标记了16％的数据，相对于基线结果，系统的性能下降了12％。对于诸如SWITCHBOARD之类的复杂任务，在错误标注训练数据的情况下，如果有16％的训练数据，则系统性能相对于基线结果将下降8.5％。对于标签错误的数据，训练过程更加鲁棒，因为用于建模基础分布的高斯混合往往会聚集在大多数正确数据周围。离群值（错误数据）对重新估计过程的贡献不大。

著录项

作者
Sundaram, Ramasubramanian H;
展开▼
作者单位

展开▼
年度 2003
总页数
原文格式 PDF
正文语种 en
中图分类

相似文献

外文文献
中文文献
专利

1. Active learning and semi-supervised learning for speech recognition: A unified framework using the global entropy reduction maximization criterion [J] . Dong Yu, Balakrishnan Varadarajan, Li Deng, Computer speech and language . 2010,第3期

机译：主动学习和半监督学习的语音识别：使用全局熵减少最大化准则的统一框架
2. An Enhanced Phoneme-Matching Algorithm Enhanced by User Feedback to Identify Possible Automatic Speech Recognition Transcription Errors [J] . James Carmichael Journal of Engineering & Applied Sciences . 2014,第9期

机译：用户反馈增强的增强音素匹配算法，用于识别可能的自动语音识别转录错误
3. A simple error classification system for understanding sources of error in automatic speech recognition and human transcription [J] . Atif Zafar, Burke Mamlin, Susan Perkins, International journal of medical informatics . 2004,第9a10期

机译：一个简单的错误分类系统，用于理解自动语音识别和人类转录中的错误源
4. EFFECTS ON TRANSCRIPTION ERRORS ON SUPERVISED LEARNING IN SPEECH RECOGNITION [C] . Ram Sundaram, Joseph Picone IEEE International Conference on Acoustics, Speech, and Signal Processing . 2004

机译：语音识别监督学习的转录错误的影响
5. Graph-based Semi-Supervised Learning in Acoustic Modeling for Automatic Speech Recognition. [D] . Liu, Yuzong. 2016

机译：用于自动语音识别的声学建模中基于图的半监督学习。
6. Words from spontaneous conversational speech can be recognized with human-like accuracy by an error-driven learning algorithm that discriminates between meanings straight from smart acoustic features bypassing the phoneme as recognition unit [O] . Denis Arnold, Fabian Tomaschek, Konstantin Sering, -1

机译：通过错误驱动的学习算法可以区分自发会话语音中的单词其准确性与人类类似可以从智能声学特征中区分出含义而绕过音素作为识别单元
7. Automatic Speech Recognition Errors Detection Using Supervised Learning Techniques [O] . Errattahi R., El Hannani A., Ouahmane H., 2016

机译：利用监督学习技术检测自动语音识别错误

Effects of Transcription Errors on Supervised Learning in Speech Recognition

摘要

著录项

相似文献

相关主题

期刊订阅